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ABSTRACT 

We recently identified a substantial population of galaxies at z > 2 with comparatively red rest-frame optical 
colors. These distant red galaxies (DRGs) are efficiently selected by the simple observed color criterion 
J s — K s > 2.3. In this paper we present near-infrared spectroscopy with Keck/NIRSPEC of six DRGs with 
previously measured redshifts 2.4 < z < 3.2, two of which were known to host an active nucleus. We detect 
continuum emission and emission lines of all observed galaxies. Equivalent widths of Ha in the non-active 
galaxies are 20 A-30 A, smaller than measured for Lyman break galaxies (LBGs) and nearby luminous infrared 
galaxies, and comparable to normal nearby galaxies. The modest equivalent widths imply that the galaxies either 
have a decreasing star formation rate, or that they are very dusty. Fitting both the photometry and the Ha lines, we 
£T) • find continuum extinction Ay = 1 — 2 mag, ages 1 — 2.5 Gyr, star formation rates 200 — 400 M Q yr _1 , and stellar 

masses 1 — 5 x 10 u Mq for models with constant star formation rates. Models with a declining star formation 
lead to significantly lower extinction, star formation rates, and ages, but similar stellar masses. From [Nil] /Ha 
ratios we infer that the metallicities are high, 1 — 1 .5 x Solar. For four galaxies we can determine line widths from 
the optical emission lines. The widths are high, ranging from 130kms _1 to 240 kms -1 , and by combining data 
for LBGs and DRGs we find significant correlations between linewidth and restframe U — V color, and between 
linewidth and stellar mass. The latter correlation has a similar slope and offset as the "baryonic Tully-Fisher 
relation" for nearby galaxies. From the linewidths and effective radii we infer dynamical masses and mass-to- 
light {M/L) ratios. The median dynamical mass is ~ 2 x 10 n M@, supporting the high stellar masses inferred 
from the photometry. We find that the median M/Ly « 0.8 (M/L)©, a factor of ~ 5 higher than measured for 
LBGs. We infer from our small sample that DRGs are dustier, more metal rich, more massive, and have higher 
ages than z ~ 3 LBGs of the same rest-frame V-band luminosity. Although their volume density is still uncertain, 
their high M/L ratios imply that they contribute significantly to the stellar mass density at z ~ 2.5. As their stellar 
masses are comparable to those of early-type galaxies they may have already assembled most of their final mass. 

Subject headings: cosmology: observations — galaxies: evolution — galaxies: formation 
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C3 ■ 1- INTRODUCTION One of the most direct tests of hierarchical galaxy formation 

L* ' j. - . , , i i . , • models is the predicted decline of the abundance of massive 

It is still not known how and when massive galaxies were , . ™ , ,• , , • , • , , , -,- 

e •• a u e -a ii galaxies with redshift. The best studied galaxies at high redshift 

formed. Early studies envisioned an epoch of rapid collapse & _ T , . rT ... „ TI 

. , ■ , , ue . e „ j , . • j . t ... , ■ , are z ~ 3 Lyman break galaxies (LBGs) (Steidel & Hamilton 

, at high redshift, followed by sustained star formation in a disk . _ . /, _ . . „ 6 IT ., v . \, •, , , 

5-T • .1 i , j. • f t u t n v u t ■ t u 1992; Steidel, Pettim, & Hamilton 1995; Steidel et al. 1996). 

.-rf and a smooth and regular dimming of the stellar light in the . , ' ' ' . . , . ' . . 

u-ji Z i r t a -a ii t> o a Measurements have been made of their clustering properties 

spheroidal component (e.g., Eggen, Lynden-Bell, & Sandage ,. , c . , . . 

iri/co\ iv /r *i u- u- 1 1 c a i (Giavahsco et al. 1998), star formation histories (Papovich, 

1962). More recently, hierarchical galaxy formation models , . _ _ , , _„„, s r ., 

• run i \* « rr^nA/i\ 1 • u » i * a t u » Dickinson, & Ferguson 2001; Shapley et al. 2001), contnbu- 

m Cold Dark Matter (CDM) cosmologies have postulated that . ' . b . , , , , „„, „ . 

, • , , U1 . • ■ tion to the cosmic star formation rate (Madau et al. 1996; Stei- 

massive galaxies have complex assembly histories, and were , , , , „„„ , , „ „ _ . , , „ . , 

, .,. b , „ u a a f * t del et al. 1999; Adelberger & Steidel 2000), rest-frame optical 

built up gradually through mergers and periods of star tor- . . ,. ' . . ° , , . . ... 

/\iru p c i mni\ t ri. a i emission lines (Pettim et al. 2001), and interaction with the 

mation (White & Frenk 1991). In these models properties T _, w ..... v , rl „„_. . . . ' T „ „ , 

, . f ' . , , , ^ IGM (Adelberger et al. 2003). Initially LBGs were thought to 

such as mass, star formation rate, and morphology are tran- , v . & . , , , ' J . , . & , 

• t- a a' i i »l u- ♦ a »u be very massive (Steidel et al. 1996), but these estimates have 

sient, depending largely on the merger history and the time , J . , ,. . , ,. „ 

, , • 4. » t v cc , ln .-. been revised following the realization that their rest-frame UV 

elapsed since the most recent merger (e.g., Kauftmann, White, , . . , ■ , , • , , , ... 

q r< a a ■ mm v rt f i 1 nnn d u * i 1 nno kinematics are dominated by winds rather than gravitational 
& Guiderdom 1993; Kauffmannet al. 1999; Baugh et al. 1998; . _ , ,„__ -L . . , * 

i\/r * i im^ motions (Franx et al. 1997; Pettim et al. 1998). The masses 
Meza et al. 2003). ,, . v _ T , ._,„,/ , n . . 

of luminous z ~ 3 LBGs appear to be ~ 10 M a (Pettim et al. 
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2001; Shapley et al. 2001), a factor of ~ 10 lower than the most 
massive galaxies today. Such relatively low masses are qualita- 
tively consistent with hierarchical models. LBGs in these mod- 
els are "seeds" marking the highest density peaks in the early 
Universe, and form the low mass building blocks of massive 
galaxies in groups and clusters (e.g., Baugh et al. 1998). 

Despite these advances and the qualitative consistency of the- 
ory and observations many uncertainties remain. Perhaps most 
fundamentally, it is still not clear whether LBGs are the (only) 
progenitors of today's massive galaxies. The highly successful 
Lyman break technique selects objects with strong UV emis- 
sion, corresponding to galaxies with high star formation rates 
and a limited amount of obscuration of the stellar continuum. 
Galaxies whose light is dominated by evolved stellar popula- 
tions, and those that are heavily obscured by dust, may there- 
fore be underrepresented in LBG samples (see, e.g., Francis, 
Woodgate, & Danks 1997; Stiavelli et al. 2001; Hall et al. 2001; 
Blain et al. 2002; Franx et al. 2003). 

Recent advances in near-infrared (NIR) capabilities on large 
telescopes have made it possible to select high redshift galax- 
ies in the rest-frame optical rather than the rest-frame UV. The 
rest-frame optical is much less sensitive to dust extinction and is 
expected to be a better tracer of stellar mass. The FIRES project 
(Franx et al. 2003) is the deepest ground-based NIR survey to 
date, with 101.5 hrs of VLT time invested in a single pointing 
on the Hubble Deep Field South (HDF-S) (Labbe et al. 2003), 
and a further 77 hrs on a mosaic of four pointings centered on 
the foreground cluster MS 1054-03 (van Dokkum et al. 2003; 
Forster Schreiber et al. 2004). 

We have selected high redshift galaxies in the FIRES fields 
by their observed NIR colors. The simple criterion J s —K s >2.3 
efficiently isolates galaxies with prominent Balmer- or 4000 A- 
breaks at z > 2 (see Franx et al. 2003). This rest-frame optical 
break selection is complementary to the rest-frame UV Lyman 
break selection. We find large numbers of red z > 2 objects in 
both fields (Franx et al. 2003; van Dokkum et al. 2003). Sur- 
face densities are ~ 3 arcmin~ 2 to K s = 22.5 and ~ 1 arcmin -2 
to K s = 21, and the space density is 30-50% of that of LBGs. 
Their much redder colors suggest they have higher mass-to- 
light (M/L) ratios, and they may contribute equally to the stellar 
mass density. Most of the galaxies are too faint in the rest-frame 
UV to be selected as LBGs. Although the samples are still too 
small for robust measurements, there are indications that the 
population is highly clustered (Daddi et al. 2003; van Dokkum 
et al. 2003; Rottgering et al. 2003). The available evidence sug- 
gests they could be the most massive galaxies at high redshift, 
and progenitors of today's early-type galaxies. 

Although these results are intriguing, large uncertainties re- 
main. The density and clustering measurements are based on 
very small areas, comprising less than five percent of the area 
surveyed for LBGs by Steidel and collaborators. Furthermore, 
owing to their faintness in the observer's optical, spectroscopic 
redshifts have been secured for only a handful of objects (van 
Dokkum et al. 2003). Finally, current estimates of ages, M/L 
ratios, star formation rates, and extinction are solely based on 
modeling of broad band spectral energy distributions (SEDs), 
and as is well known this type of analyis suffers from signifi- 
cant degeneracies in the fitted parameters (see, e.g., Papovich 
et al. 2001; Shapley et al. 2001). 

Confirmation of the high stellar masses and improved con- 
straints on the stellar populations require spectroscopy in the 
rest-frame optical (the observer's NIR). Emission lines such as 



[O III] A4959, 5007 and the Balmer Ha and H/3 lines have been 
studied extensively at low redshift, allowing direct comparisons 
to nearby galaxies. The Ha line is particularly valuable, as its 
luminosity is proportional to the star formation rate (Kennicutt 
1998), and its equivalent width is sensitive to the ratio of current 
and past star formation activity. When more lines are available 
metallicity and reddening can be constrained as well. Finally, 
the widths of rest-frame optical lines better reflect the veloc- 
ity dispersion of the Hll regions than the widths of rest-UV 
lines, which are very sensitive to outflows and supernova-driven 
winds (see, e.g., Pettini et al. 1998). 

In this paper, we present NIR spectroscopy of a small sam- 
ple of 7s — K s selected galaxies. Line luminosities, equivalent 
widths, and linewidths are determined, and the derived con- 
straints on the stellar populations and masses are combined with 
results from fits to the broad band SEDs. Results are compared 
to nearby galaxies, and also to LBGs: Pettini et al. (1998, 2001) 
and Erb et al. (2003) have studied the rest-frame optical spectra 
of LBGs in great detail, providing an excellent benchmark for 
such comparisons. 

For convenience we use the term distant red galaxies, or 
DRGs, for galaxies having J s - K s > 2.3 and redshifts z ;> 2. 
This term is more general than "optical-break galaxies", as it 
allows for the possibility that in some galaxies the red col- 
ors are mainly caused by dust rather than a strong continuum 
break. Im et al. (2002) use the designation Hyper Extremely 
Red Objects, or HEROs, for galaxies with J - K ;> 2. How- 
ever, the corresponding rest-frame optical limits are not re- 
ally "hyper extreme", as they would include all but the bluest 
nearby galaxies: at z = 2.7, our J s — K s limit corresponds to 
U - V ^ 0.1 in the rest-frame. We use Q m = 0.3, Q\ = 0.7, 
and H Q = 70kms _1 Mpc~' (Riess et al. 1998; Spergel et al. 
2003). All magnitudes are on the Vega system; AB conversion 
constants for the ISAAC filters are given in Labbe et al. (2003). 

2. SPECTROSCOPY 
2. 1 . Sample Selection and Observations 

NIR spectroscopy of high redshift galaxies is more efficient 
if the redshifts are already known, as that greatly facilitates the 
search for emission and absorption features. Although "blind" 
NIR spectroscopy of DRGs should be feasible, for this initial 
investigation we chose to limit the sample to galaxies of known 
redshift. By January 2003 (the time of our NIRSPEC obser- 
vations) redshifts had been measured from the rest-frame UV 
spectra of seven DRGs at z > 2. Five are in the field of the 
z = 0.83 cluster MS 1054-03, and are presented in van Dokkum 
et al. (2003). Two are in the Chandra Deep Field South (CDF- 
S); the rest-frame UV spectra of these galaxies are presented in 
the Appendix. Of the seven galaxies, two show rest-frame UV 
lines characteristic of Active Galactic Nuclei (AGN). 

Six of the seven DRGs were observed with NIRSPEC 
(McLean et al. 1998) on Keck II on the nights of 2003 Jan- 
uary 21 - 24. The remaining galaxy was given the lowest 
priority because it is at z = 2.705, which means that Ha and 
H/3 fall outside the NIR atmospheric windows. NIRSPEC was 
used in the medium-dispersion mode, providing a plate scale of 
2.7 Apix~' in the H band and 4.2 Apix~' in the K band. The 
wavelength range offered by the 1024 x 1024 InSb detector is 
thus roughly equal to the width of a single NIR atmospheric 
window. The 0"76 slit provides a resolution of 10 A FWHM in 
H and 14.5 A in K, corresponding to (7j nstr m 80km s _1 in each 
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band. 

The night of January 24 was lost in its entirety due to high 
winds and fog. The weather was variable January 21-23, with 
strong winds toward the end of each night and the night of Jan- 
uary 22 partly cloudy. The seeing was typically 0"8 — I'.'O in 
K. As conditions varied from mediocre to bad during our four- 
night run the data presented here are not representative of the 
full capabilities of NIRSPEC. 

The targets were acquired using blind offsets from nearby 
stars. After an on-target exposure of 900 s the offset was re- 
versed and the star was moved along the slit before offsetting 
again to the object. This method allows continuous monitor- 
ing of slit alignment, which proved to be critical as significant 
misalignments between slit and setup star were sometimes ob- 
served (and corrected). In most cases spectra were obtained at 
three dither positions (at 0", +7", and —7" with respect to cen- 
ter). After each series of dithered observations a nearby bright 
AO star was observed with the same setup to enable correction 
for atmospheric absorption and the detector response function. 

Our procedure is similar to that employed by Pettini et al. 
(2001) and Erb et al. (2003) for LBGs, with one modification: 
after each reversed offset we obtained a spectrum of the align- 
ment star. Although this extra step further increases already 
substantial overheads, it is very useful in the data reduction as 
it provides the spatial coordinate of the (typically very faint) 
object spectrum in each individual 900 s exposure. 

All six galaxies were observed in the NIRSPEC-7 (K) filter. 
For two galaxies we also obtained spectra in the NIRSPEC-4 
band (covering their Balmer/4000 A break regions) and for one 
of these two we obtained a NIRSPEC-5 (H band) spectrum as 
well (covering [O III] and H/3). Overheads were approximately 
a factor of two; the total science integration time was 10 hours. 
Integration times and observed wavelength ranges are listed in 
Table 1. 

2.2. Reduction 

The data reduction used a combination of standard IRAF 
tasks and custom scripts. As a first step the strongest cosmic 
rays were removed, in the following way. The three individ- 
ual exposures were averaged while rejecting the highest value 
at each (x,y) position, and the resulting average sky spectrum 
was subtracted from the raw spectra. The residual spectra are 
dominated by cosmic rays and sky line residuals resulting from 
variations in OH line intensities on ~ 45 min timescales. The 
residual spectra were rotated by 85.4 degrees using the IRAF 
task "rotate" such that the sky line residuals were approxi- 
mately parallel to columns. Rather than linearly interpolating 
between pixels we used "nearest" interpolation, so that each 
pixel in the rotated frame corresponds exactly to a pixel in the 
original frame (with slight redundancy) and cosmic rays retain 
their sharp edges. Sky line residuals were subtracted by fitting a 
third order polynomial along columns. Cosmic rays were iden- 
tified in each of the three residual images with L. A. COSMIC 
(van Dokkum 2001), using a 2D noise model created from the 
subtracted sky spectrum. The cosmic ray frames were rotated 
back to the original orientation and subtracted from the three 
raw spectra. We carefully checked by eye that sky lines, pix- 
els adjacent to cosmic rays, and emission lines remained unaf- 
fected by the procedure. 

Dark current, hot pixels, and a first approximation of the sky 
spectrum were removed by subtracting the average of the two 
other dither positions from each cosmic ray cleaned exposure. 



This procedure increases the noise in each individual image 
according to N « ^Jnj (n — 1 )N OIg , with N org the noise before 
sky subtraction and n the number of independent dither posi- 
tions. The number of dither positions is a trade-off between 
this noise increase and the effects of variations in atmospheric 
OH line strengths, dark current, and thermal background on the 
timescale of the dither sequence. For most of our observations 
n = 3, and the increase in the photon noise is w 22 %. 

The frames and their corresponding sky spectra were rotated 
such that the sky lines are along columns, using a polynomial 
to interpolate between adjacent pixels. The spectra were wave- 
length calibrated by fitting Gaussian profiles to the OH lines 
in the 2D sky spectra. The sky subtracted frames and the sky 
spectra were rectified to a linear wavelength scale. Sky line 
residuals were subtracted by fitting a second order polynomial 
along columns, masking the positions of the object and its two 
negative counterparts in each frame. As the spectra of the setup 
star were reduced in the same way as the science exposures 
these positions are known with good accuracy, even though the 
object spectra are generally barely visible in individual expo- 
sures. 

We extracted two-dimensional 60 pixel (8. "6) wide sections 
centered on the object from each frame. Remaining deviant 
pixels were removed by flagging pixels deviating more than 4er 
from the median of the extracted sections. Finally, the image 
sections were averaged, optimally weighting by the signal-to- 
noise ratio. Signal-to-noise ratios were determined by averag- 
ing the spectra in the wavelength direction, and fitting Gaus- 
sians in the spatial direction. 

2.3. Atmospheric Absorption and Detector Response 

After each 1-2 hr on-target observing sequence we observed 
a nearby AO star with the same instrumental setup. For the 
CDF-S targets SAO 168314 (K = 9.77; J-K= -0.03 ± 0.03) 
was used, and for targets in the field of MS 1054-03 we ob- 
served SAO 137883 (K = 9.28; J - K = 0.01 ± 0.02). The stel- 
lar spectra were reduced in the same way as the galaxy spec- 
tra, and divided by the F\ spectrum of Vega. Residuals from 
Paschen and Brackett absorption lines were removed by inter- 
polation. The resulting spectra consist of the instrumental re- 
sponse function and atmospheric absorption. The galaxy spec- 
tra were divided by these response functions. No absolute cal- 
ibration was attempted at this stage, as both slit losses and the 
variable observing conditions would introduce large uncertain- 
ties. 

2.4. Extraction of ID Spectra 

One-dimensional spectra were extracted by averaging all 
lines in the 2D spectra with average flux > 0.25 x the average 
flux in the central line, using optimal weighting. In addition 
to these averaged "raw" spectra, smoothed and binned spectra 
were created. Smoothed spectra were constructed by smooth- 
ing the raw ID spectra with a 5-pixel (~ 20 A) tophat. Binned 
spectra sample the raw spectra in 30-pixel (~ 120 A) bins, with 
the value in each bin determined with the biweight estimator 
(Beers, Flynn, & Gebhardt 1990). In both cases pixels at loca- 
tions of strong sky emission or strong atmospheric absorption 
were excluded. Extracted Zf-band spectra are shown in Fig. 1. 

3. PHOTOMETRY 

3.1. The MS 1054-03 Field 
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Fig. 1. — NIRSPEC K-b&nA spectra of J, — K s selected galaxies at z > 2. Black thin lines are raw spectra, black thick histograms are binned 
spectra, and smoothed spectra are in red. Grey vertical bands denote areas of strong sky emission or absorption; these regions were excluded in 
the smoothed and binned representations of the data. Emission lines were detected for all galaxies, with typical rest-frame equivalent widths of 
20-30 A. The continuum is also detected in every case. 



The optical and NIR photometry in the MS 1054-03 field are 
described in van Dokkum et al. (2003) and Forster Schreiber 
et al. (2004). Briefly, the dataset is a combination of a mosaic 
of HST WFPC2 images (van Dokkum et al. 2000) and opti- 
cal and NIR data obtained with the VLT as part of the FIRES 
project (Franx et al. 2000). The data reach a 5a limiting depth 
of K s = 23.8 for point sources. The galaxies described in this 
paper have K s < 20; therefore, the uncertainties in the NIR pho- 
tometry are almost entirely systematic. 

The derivation of colors and total K s magnitudes is described 
in Forster Schreiber et al. (2004). The methodology follows 
that of Labbe et al. (2003), who applied identical procedures to 
the FIRES dataset on HDF-S. The observed fluxes required cor- 
rection for the weak lensing effect of the foreground z = 0.83 
cluster. The Hoekstra, Franx, & Kuijken (2000) weak lensing 
mass map, derived from the HST mosaic, was used to calcu- 
late the magnification for each galaxy based on its position and 
redshift. The median lensing amplification is 0.15 magnitudes; 
Table 1 lists the observed and corrected values. We note that the 
foreground cluster aided in the observing efficiency: in its ab- 
sence we would have had to integrate 20-50 % longer to reach 



the same S/N ratios. All quantities given in this paper were 
derived from the de-lensed magnitudes. 



3.2. Chandra Deep Field South 

For CDF-S we used public VLT/ISAAC NIR observations 
from the Great Observatories Origins Deep Survey (GOODS), 
combined with public European Southern Observatory (ESO) 
optical images in the same field. The dataset is described in 
Daddi et al. (2004). The GOODS imaging is slightly less 
deep than the FIRES data in MS 1054-03, but still more than 
adequate for the two relatively bright galaxies discussed here. 
Magnitudes and colors were measured in a similar way as in 
the MS 1054-03 field. The //-band zeropoint is not yet well 
determined at the time of writing, as the data have systematic 
calibration problems at the level of 10-15 %; we took this into 
account by assigning an uncertainty of 0.2 mag to the //-band 
data. Total K s magnitudes for both galaxies are listed in Ta- 
ble 1. Both galaxies have K s < 20, but they fall outside the 
region studied by the K20 survey (Cimatti et al. 2002). 
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4. ANALYSIS 

4. 1 . Continuum Emission and Spectral Breaks 

K-bwA continuum emission is detected of all six observed 
galaxies (Fig. 1). This high detection rate can be contrasted 
with results obtained for LBGs by Pettini et al. (2001), who 
detected continuum emission for only two out of 16 galaxies in 
their primary sample. The reason for this difference is simply 
that the Zf-magnitudes of our spectroscopic sample of J s — K s 
objects are typically much brighter than those of z ~ 3 Lyman 
break galaxies. In fact, their comparitively high luminosity in 
the NIR makes the J s — K s selected galaxies eminently suitable 
for NIR spectroscopy on 8-10m telescopes. The continuum 
detections are very important, as they allow us to determine 
accurate equivalent widths and line luminosities. Had the con- 
tinuum not been detected, absolute calibration of the spectra 
would have been required, leading to typical uncertainties of a 
factor of two (see Pettini et al. 2001). 




3800 4000 4200 4400 



Fig. 2. — Binned spectrum of the brightest galaxy in our sample 
(M-1319 at z = 2.43) in the N4-band (black histogram), along with 
the unbinned 2D spectrum (top). Also shown is the spectrum of a 
nearby AO star (bottom) which was used to correct for atmospheric 
absorption and detector response. The blue line shows the Bruzual & 
Chariot (2003) model which best fits the broad band photometry. This 
model with an age of 2.6 Gyr and extinction Av = 1.3 mag provides a 
remarkably good fit to the observed spectrum. The red line shows a 
model with a very young age and high extinction (Av = 3); this model 
overpredicts the flux blueward of 3900 A (see § 5.1.2). 

The S/N in the continua is not sufficient to detect individ- 
ual absorption lines. However, for the brightest galaxy in our 
sample we detect a significant drop in the continuum in the re- 
gion of the redshifted B aimer/4000 A-break. Figure 2 shows 
the N-4 spectrum of galaxy M-1319 (z = 2.423). The N4- 
band lies between the J and H bands, and suffers from strong 
atmospheric absorption. The continuum on the red side of 
the absorption trough is stronger than the continuum on the 
blue side, indicating a drop across the Balmer/4000 A-break. 
The biweight mean over the wavelength range 1.28 — 1.33/im 
(3739 - 3885 A in the rest-frame) is 0.46 ± 0.04 (arbitrary 



units), compared to 0.69 ± 0.03 over the wavelength range 
1.43 - 1.57 fim (4177 -4587 A in the rest-frame), a drop of 
a factor of w 1.5. 

We conclude that the J s — K s criterion isolated a galaxy with a 
continuum break, as it was designed to do. As will be discussed 
in § 5. 1 .2 the S/N is just sufficient to provide a constraint on the 
dust content. Moreover, this detection in a relatively short expo- 
sure time highlights the potential of present-day telescopes for 
quantitative continuum studies of galaxies with evolved stellar 
populations at z > 2. 

4.2. Emission Lines 

4.2.1. Detections 

Emission lines were detected for all observed galaxies. Two 
galaxies were known to host AGNs; both show strong and fairly 
broad emission lines. Three of the remaining galaxies show nar- 
row lines typical of normal star forming galaxies, and the fourth 
shows a broad line probably indicating an active nucleus. Each 
of the six observed galaxies is briefly discussed below. 
M-1035 - This object was identified as a Type-II AGN based 
on its narrow Lya, C IV and He II lines in the rest-frame UV 
(van Dokkum et al. 2003). We observed the galaxy in the 
N-4, N-5 (H), and N-7 (K) bands. The continuum was de- 
tected in every band as well as several emission lines. All lines 
have widths < 2000km s _1 , confirming the Type-II classifica- 
tion (see, e.g., Stern et al. 2002; Norman et al. 2002). The 
[Olll] lines have width a = 231 km s" 1 (see §4.2.3), and the 
broadest feature is the Ha + [N II] complex, width a combined 
FWHM of 1 600 km s ~~ 1 . The object is a faint hard X-ray source 
(Johnson, Best, & Almaini 2003), consistent with the idea that 
the central engine is (partially) obscured by dust. 
M-1356 - The rest-frame UV spectrum of M-1356 indicates 
the presence of an AGN (van Dokkum et al. 2003), with char- 
acteristics of Broad Absorption Line QSOs. The UV spec- 
trum shows strong Nitrogen lines indicating a high metallic - 
ity. The K-bwd spectrum shows fairly broad Ha + [N II] emis- 
sion with FWHM ~ 2500 km s -1 . This galaxy is not a Chan- 
dra source, consistent with the BAL nature of its active nucleus 
(e.g., Green & Mathur 1996). We note that due to a revision 
of our photometry since the NIRSPEC observations this galaxy 
has J s — K s = 2.13, and hence falls blueward of our DRG crite- 
rion; removing this galaxy from the analysis does not alter the 
conclusions in any way. 

M-1383 - For galaxy M-1383 the rest-frame optical spec- 
troscopy implies a revision of the rest-frame UV redshift. We 
deduced a redshift z = 3.525 from faint absorption lines in the 
rest-frame UV (van Dokkum et al. 2003); however, in the K- 
band we detect Ha and [N II] redshifted to z — 2.423, making 
this the fourth red galaxy at z ~ 2.43 in the MS 1054-03 field. 
The revised redshift is consistent with the photometric redshift 
in van Dokkum et al. (2003). The rest-UV spectrum has low 
S/N, and the previously published redshift may simply be in er- 
ror. Another possibility is that the blue feature seen off-center 
in this object is a background galaxy at z = 3.52. 
MS-1319 - The brightest galaxy in the sample in the /f-band, 
enabling the direct detection of the continuum break responsi- 
ble for its red J s — K s color (§ 4. 1). The ^T-band spectrum shows 
Ha and [N II]. 

C-237 - The spectrum of C-237 shows a broad, low S/N 
Ha + [N II] complex. Higher S/N spectra are needed to mea- 
sure the width reliably, but we infer that the galaxy probably 
hosts an active nucleus. The association of C-237 with a hard 
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Fig. 3.— Gaussian fits to emission lines. For M-1035 and C-3396 the lines are [O III] 4959,5007 A; forM-1383 and M-1319 they are Ha and 
[N II] A6584. Raw spectra are shown in black; red lines are best fitting models. Line widths are also shown, corrected for instrumental broadening. 
The line widths are substantial, typical of massive nearby galaxies. 



X-ray source (Alexander et al. 2003) is consistent with this in- 
terpretation. 

C-3396 - The redshifted [O III] lines are detected at the ex- 
pected locations; H(3 is barely detected. The lines are resolved 
in the spatial direction, and there is some evidence for a velocity 
gradient. The galaxy is a faint soft source in the 1 Ms Chandra 
catalog (Giacconi et al. 2002), possibly indicating a very strong 
star burst. The X-ray properties of DRGs will be discussed in 
K. Rubin et al., in preparation. 

4.2.2. Line Luminosities and Equivalent Widths 

For each galaxy we modeled the continuum with a second 
order polynomial, fitted to the binned spectrum. Emission lines 
were masked in the fit. After subtracting the continuum, line 
fluxes were measured by summing the counts in the residual 
spectrum in regions centered on the redshifted lines. The width 
of these regions was typically 50 A, except for two cases where 
Ha is broader (see §4.2.1). Errors were derived from simula- 
tions, by placing random apertures in regions with similar noise 
characteristics and by varying the parameters of the continuum 
fit. 

Calibration of the line fluxes was performed in the follow- 
ing way. First the emission lines were added to the continuum 
fits; we used the fits rather than the measured continuum to 
limit the effects of sky line residuals. Next, the spectra were 
converted from F\ to F v and convolved with the VLT ISAAC 
broad band filter response curves appropriate for the observed 



wavelength range. Finally, the spectra were normalized using 
the measured magnitudes in these filters. The broad band pho- 
tometry is accurate to w 0.05 mag. Total magnitudes were used, 
so that the measured line fluxes are effectively corrected to an 
aperture covering the entire galaxy. Equivalent widths were de- 
termined directly by comparing the line flux to the continuum 
fit at the location of the line, and require no absolute calibration. 
The measured line properties are listed in Table 2. 

4.2.3. Line Widths 

We determined line widths by fitting Gaussian models to the 
ID spectra. Galaxies C-237 and M-1356 were excluded, as for 
both galaxies the Ha + [Nil] line complex is too broad to sep- 
arate the two lines with confidence. Galaxy M-1035 also has 
a broad Ha line complex, but for this object we can obtain a 
reliable line width from its [O III] lines. 

For each of the four galaxies two lines were detected (see Ta- 
ble 2); both lines were fitted simultaneously. Free parameters in 
the fits are the redshift, fluxes of both lines, and the line width. 
The continuum was allowed to vary within the uncertainties of 
the polynomical fit to the binned spectrum (see §4.2.2). The 
fitting procedure minimizes the sum of the absolute differences 
between model and data to limit the effects of outliers; mini- 
mizing the rms gives similar results. Pixels in regions of strong 
sky lines were excluded from the fit. 

Best fitting models are shown in Fig. 3; line widths corrected 
for instrumental broadening are listed in Table 2. Errors were 
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Fig. 4. — Spectral energy distributions of J s — K s selected galaxies with spectroscopic redshifts z > 2. Large open circles show observed fluxes, 
and solid circles show fluxes corrected for emission line contributions. Thin lines show best fitting stellar population synthesis models with 
constant star formation rate and varying age and dust content. Small open circles are model fluxes in the broad band filters. The median age is 
1.6 Gyr and the median absorption A v = 1.8 mag. For comparison the lower right panel shows the median best fitting model to z = 3 Lyman break 
galaxies from Shapley et al. (2001), with a much younger age of 0.3 Gyr and absorption A v = 0.5 mag. 



determined from fits to simulated spectra. One-dimensional 
noise spectra were extracted from the 2D spectra at random spa- 
tial positions and added to model spectra; these artifical spec- 
tra have very similar noise characteristics as the galaxy spec- 
tra. The errors were determined from the biweight width of 
the distribution of differences between true and measured line 
widths. The line width of galaxy C-3396 has a large (asym- 
metric) uncertainty; although the faint [O III] lines are resolved, 
their width is not well determined. 

5. STELLAR POPULATIONS 

5.1. Fits to Broad Band Photometry 

For all galaxies high quality photometry is available over the 
wavelength range 0.35 — 2.2 /x, and following previous studies 
of LBGs we fit model SEDs to the photometric data to constrain 
star formation rates, ages, and the extinction of the stellar con- 
tinua. For completeness we include galaxy M-140 in this anal- 
ysis, as it is the only DRG with redshift that was not observed 



with NIRSPEC, and its redshift implies that the NIR photome- 
try will only be marginally affected by emission lines. We note 
that similar fits were presented in van Dokkum et al. (2003) 
for the five galaxies in the MS 1054-03 field. Fits to the entire 
sample of DRGs in MS 1054-03 (i.e., including those without 
spectroscopic redshift) are presented in Forster Schreiber et al. 
(2004). 

5.1.1. Corrections for Emission Lines 

Strong emission lines can affect the broad band fluxes of high 
redshift galaxies and may cause systematic errors in derived 
stellar population parameters. In Lyman break galaxies emis- 
sion lines typically contribute ~ 15 % to the broad band fluxes 
(Shapley et al. 2001). This estimate is somewhat uncertain be- 
cause it relies on absolute calibration of the NIR spectroscopy 
(see Pettini et al. 2001). The corrections can be determined 
with greater confidence for the present sample of DRGs be- 
cause the continuum is detected in every case. 

Emission line corrections for all observed photometric bands 
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Fig. 5. — Results of Bruzual & Chariot (2003) model fits to the emission-line corrected broad-band SEDs. Large symbols show the best fitting 
values of age and star formation rate. Small symbols show the range in best fit parameters for each galaxy, derived from randomizing the photom- 
etry within the (random and systematic) photometric errors. Dashed lines are of constant mass. The ages and star formation rates have substantial 
uncertainties, but the stellar masses are well constrained. They are very high at 1 — 5 x 10" M Q . 



are listed in Table 3. Values in parentheses have been calcu- 
lated with the help of the following relations, determined from 
the Jansen et al. (2000) sample of nearby galaxies: 

W A ([OII]) = 0.98±0.52 x W x (Ha) (1) 

W x (UP+[0 III]) =0.53 ±0.22 x W\ (Ha) (2) 

W A ([OH]) = 1.8 ±0.4 x W A (A5007). (3) 

The corrections range from < 0.05 to 0. 16 magnitudes in the K s 
band, with a median of 0.08. Our sample is biased toward the 
presence of emission lines in the rest frame UV (as this makes 
it easier to measure the redshift), and it is reasonable to assume 
that it is also biased toward the presence of lines in the rest 
frame optical. Therefore, the small contributions measured here 
strongly suggest that the red colors of J s — K s selected galaxies 
in general are caused by red continua rather than the presence 
of strong emission lines in the K s band. 

5.1.2. Fitting Methodology and Results 

Emission-line corrected SEDs are shown in Fig. 4 (solid 
symbols), along with the uncorrected photometry (open sym- 
bols). We fitted stellar population synthesis models to the cor- 
rected photometry. The Bruzual & Chariot (2003) models were 
used; this latest version is based on a new library of stellar spec- 
tra and has updated prescriptions for AGB stars. 

A major systematic uncertainty in this type of analysis is the 
parameterization of the star formation history (Shapley et al. 
2001; Papovich et al. 2001). The ubiquitous emission line de- 
tections in our sample imply that the galaxies are not evolving 
passively, and that models in which all stars were formed in a 
single burst at high redshift are not appropriate. For simplic- 
ity we limit the discussion to models with constant star for- 
mation rates, characterized by three parameters: the time since 
the onset of star formation (the age), the star formation rate, 
and the extinction. Models with a declining star formation rate 



are discussed briefly in §5.3, and in Forster Schreiber et al. 
(2004). We only consider solar metallicity models; as we show 
in § 5.6 such models are consistent with the limited information 
we have on the abundances of DRGs. 

The publicly available HYPERZ photometric redshift code 
(Bolzonella, Miralles, & Pello 2000) was used for the fitting. 
Its template library was updated with the Bruzual & Chariot 
(2003) models and the redshift was held fixed at the spectro- 
scopic value. The Calzetti et al. (2000) reddening law was ap- 
plied; we tested that the results are not very sensitive to the as- 
sumed extinction curve. A minimum photometric error of 0.05 
was assumed, so that individual data points with very small for- 
mal errors do not dominate the x 2 minimalization. 

Best fitting SEDs are overplotted in Fig. 4. Corresponding 
star formation rates, ages and reddening values are listed in Ta- 
ble 4. The constant star formation models provide reasonable 
descriptions of the photometric data. The median absolute dif- 
ference between data and model is w 0.07 mag for galaxies C- 
3396, M-140, M-1356 and M-1383, and w 0.2 mag for the 
other three galaxies. However, as the errors in the photometry 
are typically much smaller, the \ 2 minima are mostly greater 
than one, with median 2.9. 

Confidence limits on the derived parameters were determined 
in the following way. For each galaxy, 500 simulated SED dat- 
apoints were created by randomizing the photometry within the 
uncertainties. In order to account for systematic errors and tem- 
plate mismatch the uncertainties were multiplied by the square 
root of the \ 2 value of the best fitting model. The simulated 
SEDs were modeled using the same procedure as the observed 
SEDs. Figure 5 shows the best fit age and star formation rate 
for each galaxy (large symbols) along with values for 100 of 
the simulated SEDs (small symbols). Uncertainties in the ages 
are highly correlated with those in the star formation rates. As 
stellar mass is the product of star formation and age in these 
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models the uncertainty in this parameter is relatively small. 

The lcr confidence limits listed in Table 4 were determined 
from the simulations by calculating the widths of the distribu- 
tions of age, star formation rate, reddening, and stellar mass. 
The confidence limits are generally not symmetric. For all 
galaxies "maximally old" fits fall within the confidence limits, 
i.e., in every case the upper limit to the age is set by the age of 
the universe at the epoch of observation. The uncertainties are 
largest for galaxy M-1383, which has a smooth SED without a 
pronounced break. 

The stellar populations in these galaxies appear to be quite 
similar. All seven have relatively old ages ranging from 0.7 - 
2.6 Gyr, and substantial extinction ranging from 0.9 - 2.3 mag- 
nitudes in the rest frame V band. The implied star formation 
rates range from 1O2M yr~' to 468 M yr _1 , and the stellar 
masses from 1.3 x lO n M to 4.9 x 1O U M . We note that the 
parameters for galaxies M-1035 and M-1356 are uncertain, as 
their active nuclei may affect their continua. 

5.2. Constraints from Ha 

The SED fitting results suggest that the DRGs are unlike Ly- 
man break galaxies: their star formation rates are similar, but 
their ages and dust content are substantially larger. The implied 
stellar masses rival those of elliptical galaxies in the nearby 
Universe. In light of these important implications and the sys- 
tematic uncertainties associated with SED fitting it is critical to 
compare the results to those implied by our NIR spectroscopy. 

5.2.1. Star Formation Rates and Ages 

The Ha emission line measures the current star formation 
rate (through the line luminosity) as well as the ratio of current 
and past star formation (through the equivalent width). Dust- 
corrected line luminosities can be converted to the instanta- 
neous star formation rate using 



SFR(M yr" 1 ) =7.9 x 10" 42 L Hct (ergss" 



(4) 



(Kennicutt 1998). In local galaxies this relation is consistent 
with other indicators of the star formation rate, such as radio lu- 
minosity and far-infrared luminosity (e.g., Kewley et al. 2002). 
Such studies have not yet been done systematically at high red- 
shift, although results for LBGs suggest that this correspon- 
dence holds for these galaxies as well (e.g., Reddy & Steidel 
2004). 

For constant star formation histories the equivalent width has 
a one-to-one relation with the time elapsed since the onset of 
star formation (the age). The dependence of age on equivalent 
width has the form of a powerlaw. Using the Bruzual & Char- 
lot (2003) models to find the continuum luminosity and Eq. 4 
to convert line luminosity to star formation rate we find 

log(age) = 13.2-2.41ogW A (Ha) (5) 

with W\ the rest frame equivalent width in A and the age in 
years. Over the range 7.5 < log (age) < 9.5 the powerlaw ap- 
proximation holds to a few percent. Note that the relation de- 
pends on the conversion factor between star formation rate and 
line luminosity (Eq. 4), but not on the star formation rate itself. 

The main uncertainty is attenuation by dust. Ha luminosi- 
ties are usually corrected for extinction by comparing them 
to H/3, assuming the intrinsic ratio of case-B recombination. 
However, as we have not measured the Ha/H/3 pair in any of 
the galaxies, we have to derive the extinction from the attenu- 
ation of the stellar continuum. Studies of nearby and distant 
galaxies indicate that the extinction toward Hll regions is at 



least as high as that of the stellar continuum (e.g., Kennicutt, 
1992, 1998; Calzetti et al. 1996; Erb et al. 2003). The re- 
lation depends on the distribution of the dust, which is diffi- 
cult to constrain observationally. In the following, we consider 
two cases: 1) no additional extinction toward Hll regions. In 
this case, L(Ha) corr = 10 ' 37Al, L(Ha) obs (with A v the extinc- 
tion of the stellar continuum), and W\(Ha) con — WA(Ha) bs; 
and 2) additional V-band extinction of one magnitude. In 
that case, L(Ha) corr = 10 a37 ( Al '+ 1 )L(Ha) obs , and W x (Ua) cm = 
2.34WA(Ha) b s . These two cases probably bracket the actual 
values. 

5.2.2. Consistency with SED fits 

We first consider whether the ages, star formation rates, and 
extinction derived from the SED fits are consistent with the ob- 
served luminosities and equivalent widths of Ha. Values im- 
plied by the SED fits were calculated using Eqs. 4 and 5, and the 
star formation rates and ages listed in Table 4. Extinction cor- 
rections were applied following the procedure outlined above. 
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In Fig. 7(a) the line luminosities implied by the SED fits are 
compared to the observed ones. Galaxies M-1035 and M-1356 
are not shown, as their emission lines are almost certainly domi- 
nated by their active nuclei. For C-3396 we use Lu a = 2.75Lh/3 
(Osterbrock 1989), after taking differential extinction into ac- 
count. The SED results overpredict the Ha observations for 
A^hii =^t/ sed, but they agree very well if Hll regions suffer 
additional extinction of approximately one magnitude. 

In Fig. 7(b) predicted equivalent widths are compared to the 
observed ones. The value for C-3396 was determined using 
Wx(H/3) = 0.22W x (Ua) (Jansen et al. 2000). Again, the data 
are in good agreement and indicate additional extinction of one 
magnitude toward Hll regions. 

5.2.3. Constraints on Ay 

Although the agreement between SED fits and Ha measure- 
ments is encouraging the two constraints are not truly indepen- 
dent, as they both rely on the extinction as derived from the 
broad band photometry. Furthermore, the additional extinction 
toward Hll regions is a free parameter in the modeling. Here 
we discuss the dependence of the age and star formation rate 
on the dust content in more detail. We only consider the two 
non-active galaxies for which Ha has been reliably measured 



(M-1383 andM-1319). 

We first determine the dependence of the SED fitting results 
on the extinction. Rather than having Ay as a free parameter 
in the fits it was held fixed at values ranging from Ay = to 
Ay — 3; only the age and star formation rate were allowed to 
vary. As shown in Figure 6 (blue lines) the dependencies are 
quite strong, because the age and star formation rate are very 
sensitive to the UV flux, which is a strong function of Ay. 

Next we examine the extinction-dependence of the con- 
straints derived from Ha (red lines). The solid lines show the 
derived star formation rates and ages assuming Av hii = A^sed- 
Importantly, the implied ages are higher than the age of the Uni- 
verse at z = 2.43 with this assumption, in particular for galaxy 
M-1319. The low Ha equivalent widths thus imply either addi- 
tional absorption toward Hll regions, or a declining star forma- 
tion rate (see § 5.3). Note that this result follows directly from 
Eq. 5, and is independent from the SED fits. Dashed lines show 
the effect of an additional magnitude of extinction for the line 
emitting gas; with this assumption the implied ages are lower 
than the age of the Universe. 

Furthermore, the dependencies are much weaker than for 
the SED fits, because they are determined by the absorp- 
tion at 6563 A rather than in the UV. As an example, for 
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galaxy M-1383 the Ha-derived star formation rate varies from 
25M Q yr~ 1 to 300M Q yr~' for A v =0 — 3, whereas the star 
formation rate derived from the SED fits varies between zero 
and 2OOOM yr~ 1 . The different dependencies on the extinc- 
tion can be utilized: we can determine Ay by requiring that 
the ages and star formation rates derived from the SED fits are 
equal to those derived from Ha. Consistent solutions are ob- 
tained at the intersections of the lines derived from the SED 
fits and those derived from Ha. We find 0.8 < Ay < 1.2 for 
M-1319 and 1.8 ^A v < 2.6 for M-1383, if the" additional ex- 
tinction toward Hll regions is approximately one magnitude. 

This derivation of the extinction is independent of the values 
corresponding to the best fitting SED: there is no "built-in" re- 
quirement that the solutions with the lowest x 2 residual should 
fall in the regions of the plots where the SED constraints and 
the Ha constraints are consistent with each other. Nevertheless, 
they do: the large solid symbols correspond to the best fitting 
SEDs (shown in Fig. 4); as in Fig. 5 small dots indicate the un- 
certainty. These results considerably strengthen the conclusion 
from §5.2.2: the constraints on ages and star formation rates 
from Ha are consistent with those derived from the broad band 
photometry only for the values of Ay corresponding to the best 
fitting SEDs. Furthermore, the additional extinction toward Hll 
regions (which was treated as a free parameter in § 5.2.2) is re- 
quired by the strict upper limit on the ages imposed by the age 
of the Universe at z = 2.43. 

5.3. Models with Declining Star Formation Rates 

The inferred star formation rates are very sensitive to the 
(high) extinction that we derive. We can determine the "mini- 
mum" star formation rates for galaxies M-1319 and M-1383 
by applying no extinction correction to their Ha luminosi- 
ties. The uncorrected star formation rates are 23 M yr _1 and 
24 Mq yr _1 respectively, i.e., an order of magnitude lower than 
the extinction-corrected values (see Table 5). As we demon- 
strated in the preceeding Sections, the SEDs of the galaxies, 
and the Ha equivalent widths, cannot be fitted with continuous 
star formation models with low extinction. Here we investigate 
what values we find for the extinction and star formation rates 
if we allow for the possibility that the star formation rates are 
not constant but declining. We only consider the two non-active 
galaxies with measured Ha luminosities; a more extensive dis- 
cussion is given in Forster Schreiber et al. (2004), which de- 
scribes SED fits to the full sample of galaxies with K s < 21.7 
and J s - K s > 2.3 in the MS 1054-03 field. 

Galaxies M-1319 and M-1383 were fitted with Bruzual & 
Chariot (2003) models characterized by a star formation history 
with an e-folding time r = 300 Myr. Both galaxies are well fit- 
ted by this model; differences in the best fitting x 2 between the 
declining and the continuous model are not statistically signif- 
icant. For both galaxies the r-model gives lower dust content, 
lower ages, and lower star formation rates than the continu- 
ous model; values are listed in Table 5. However, the mean 
star formation rate over the lifetime of the galaxy is actually 
slightly higher in the r-model: 507M Q yr _1 for M-1383 and 
330M Q yr~' for M-1319. As a result, the stellar masses in the 
two models are similar. These effects are illustrated in Fig. 8, 
which shows the star formation histories of both galaxies for 
both models. The effect of the r-model is to make the star 
formation history more concentrated and peaked in time, but it 
does not substantially change the total number of stars formed. 

We conclude that the extinction and star formation rates are 
model-dependent, and can be lower by factors of 2 — 3 if the star 



formation histories are modified. However, the stellar masses 
are robust, because declining models have a much higher star 
formation rate at earlier epochs (see Fig. 8). 
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Fig. 8. — Best fitting star formation histories of galaxies M-1383 
(red) and M-1319 (blue), for a continuous star formation model 
(dashed) and a r-model with an e-folding time of 0.3 Gyr (solid). 
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mation rates at the epoch of observation, but they have much higher 
star formation rates at earlier epochs. 



5.4. Continuum Break 

For one galaxy (M-1319) we obtained a spectrum in the rest- 
frame wavelength range 3750 A - 4550 A (see §4.1), providing 
an additional constraint on its stellar population. The spectrum 
can be used to disentangle the effects of age and dust. Young 
dusty galaxies have smooth SEDs, whereas galaxies with stellar 
populations older than ~ 500 Myr have a pronounced disconti- 
nuity at A3625A (the Balmer discontinuity) or A4000A (the 
Can H+K break). 

The spectrum is of insufficient quality to constrain the age 
and dust content independently. Instead, we generated model 
spectra for different values of Ay by fitting constant star forma- 
tion models to the broad band photometry (see §5.2.3). This 
constrained fit ensures that the best fitting model is consistent 
with the broad band photometry (or at least as close as possi- 
ble for that particular value of Ay). The models were binned in 
the same way as the observed spectrum, and fitted using a x 2 
minimalization. 

The best fitting Ay = 0.6^ consistent with the extinction 
derived from the broad band photometry (Ay = 1 .3 ± 0. 1) and 
from the comparison of SED fits and Ha constraints (Ay = 
0.8— 1.2). The uncertainties are quite large, partly because 
the spectrum does not extend blueward of the Balmer break. 
Models with extinction higher than Ay 2 fit poorly because 
they overpredict the flux blueward of 3900 A and underpredict 
the flux redward of 4400 A; an example with Ay = 3 is shown 
by the red histogram in Fig. 2. The best fit to the broad band 
photometry is shown in blue; this model is a good match to the 
observed spectrum. 
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5.5. SCUBA Detection 

Independent constraints on the star formation rate are very 
important as they can help break the degeneracy between the 
form of the star formation history and the current star forma- 
tion rate (see § 5.3). K. Knudsen et al. (in preparation) obtained 
a deep, large area SCUBA map of the MS 1054-03 field, and 
we have performed an initial investigation of coincidences of 
SCUBA detections and J s — K s selected galaxies. Interestingly 
galaxy M-1383, the object in our sample with the highest in- 
ferred dust content, is very likely a SCUBA source. 

The object is detected with an 850 /i flux density of 5.1 ± 
1.5 mJy. Assuming the infrared SED is similar to that of 
Arp220 the IR luminosity is 5 x 10 12 L Q , and the implied star 
formation rate is ~ 500M Q yr _I , in excellent agreement with 
the (extinction-corrected) value of ~ 400 M Q yr~' that we de- 
termine from the stellar SED and Ha in the constant star forma- 
tion model (see Table 5). We note that such a good agreement 
is uncharacteristic of local ultra-luminous infrared galaxies, as 
star forming regions in these galaxies usually suffer from such 
high visual extinction that it is very difficult or impossible to 
correct for it. A full analysis of submm and X-ray constraints on 
the star formation rates of DRGs will be presented elsewhere; 
here we note that the submm data for this galaxy provide strong 
independent support for our modeling procedures. 

5.6. Metallicities 

Determining metallicities for high redshift galaxies is diffi- 
cult, as generally only a limited number of (bright) emission 
lines is available. The most widely used metallicity indicator 
for faint galaxies is the R23 index, which relates O/H to the rel- 
ative intensities of [Oil] A3727, [Olll] AA4959,5007, and H/3 
(Pagel et al. 1979; Kobulnicky, Kennicutt, & Pizagno 1999). 
Using this indicator Pettini et al. (2001) derive abundances for 
five LBGs of between 0. 1 x and 1 x Solar, with the main uncer- 
tainty the double-valued nature of the R23 index. 

We do not have the required measurements to calculate 
the R23 index for any of the four non-active galaxies in our 
sample. However, for two objects the luminosities of Ha 
and [Nil] A6584 are available, enabling us to derive metal- 
licities using the "N2" index. This indicator is defined as 
N2= log ([N II] A6584/ Ha); it was first suggested by Storchi- 
Bergmann, Calzetti, & Kinney (1994) and discussed recently 
by Denicolo, Terlevich, & Terlevich (2002). Advantages of the 
N2 indicator are that it has a single-valued dependence on the 
oxygen abundance and that it is insensitive to reddening. How- 
ever, it is sensitive to ionization and, as oxygen is not measured 
directly, to O/N variations (see, e.g., Denicolo et al. 2002). 

For objects M-l 383 and M-l 3 19 we find N2 = -0.40 ± 0. 16 
and N2 = —0.10 ±0.13 respectively. Using the calibration of 
Denicolo et al. (2002), 

12 + log(0/H) =9.12(±0.05)+0.73(±0.10) xN2, (6) 
we find 12 + log(0/H) = 8.8 ± 0.3 and 9.0 ± 0.3 respec- 
tively (where we used an intrinsic scatter in the method of 
0.2 dex; see Denicolo et al. 2002). As the Solar abundance 
12 + log(O/H) = 8.83, we infer that the abundances of the 
two J s — K s selected galaxies are 1 x and 1 .5 x Solar, with un- 
certainties of a factor of two. We conclude that current evidence 
suggests that red z > 2 galaxies are already quite metal rich, as 
might have been expected from their high dust content and ex- 
tended star formation histories. 

6. KINEMATICS 



6.1. Masses and M/L Ratios 

The stellar masses implied by the SED fits are substantial, 
ranging from 1.3 -4.9 x 10 n M Q , with median 3.5 x 10 n M Q . 
Although these results are supported by the Ha luminosities 
and equivalent widths significant systematic uncertainties re- 
main: the SED fitting is dependent on the assumed IMF and 
metallicity, and both methods are sensitive to the assumed star 
formation history and extinction law. 

Independent constraints on the masses of the red galaxies 
can be obtained from their kinematics. Line widths were mea- 
sured for four galaxies, one of which harbors an AGN (see 
§ 4.2.3). The widths are substantial, and range from 134kms~ 1 
to 240km s _1 . The line broadening can be caused by regular 
rotation in a disk, random motions, or non-gravitational effects. 
Rix et al. (1997) derive an empirically motivated transforma- 
tion from observed line width to rotation velocity in a disk; ap- 
plying their relation a w 0.6K- gives rotation velocities in the 
range 200^-OOkms -1 . The kinematics are thus in qualitative 
agreement with the high masses inferred for the stars. 

The large linewidths imply that the four red galaxies are very 
massive, unless they are highly compact. The galaxies are faint 
in the rest-frame UV which makes it difficult to measure their 
sizes from optical HST images. Fig. 9 shows K s images of the 
four galaxies with measured velocity dispersions, all taken with 
ISAAC on the VLT in w 0."5 seeing. All four galaxies are re- 
solved in these ground-based images. Effective radii are ~ 0"5, 
with the largest galaxy (M-1383) having r e « 0."7 (Trujillo et 
al. 2004 and in preparation). Using r e w 0"5 as a reasonable es- 
timate for the size, the median a k, 200kms~ 1 for the velocity 
dispersion, and the same normalization as Pettini et al. (2001), 
we find Md yn ~2x 1 1 1 M© . This number is in good agreement 
with the stellar mass estimates. 




Fig. 9. — Ground based K s images of the four galaxies with measured 
line widths, obtained with ISAAC on the VLT. Images are 5" x 5"; 
the FWHM image quality is w 0"5. All four galaxies are resolved; 
they have typical effective radii of ~ 0."5. 

The median absolute V magnitude of the four galaxies is 
-23.6 (see below). With M~2x lQ n M Q this implies dy- 
namical mass-to-light ratios M/L v ~ 0.8 in Solar units, a fac- 
tor of ~ 5 higher than z ~ 3 Lyman-break galaxies (Pettini et al. 
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Fig. 10. — Correlations of rest-frame optical linewidth with rest-frame K-band absolute magnitude (a), rest-frame U — V color (b), and stellar 
mass (c). Small open circles are Lyman-break galaxies at z ~ 3, the large open symbol is a LBG at z = 2. 19, and filled circles are J s — K s selected 
galaxies with mean z = 2.6. Linewidth correlates with luminosity, color, and stellar mass in the combined sample. The line in (c) is the relation 
between rotation velocity and baryonic mass determined for local galaxies by McGaugh et al. (2002), using a « 0.614 (Rix et al. 1997). This 
relation provides an excellent fit to the z > 2 galaxies. 



2001). Based on the range of values within our sample we es- 
timate that the uncertainty in this number is approximately a 
factor of two. 

6.2. Correlations with Linewidth 

Pettini et al. (2001) studied Tully-Fisher (Tully & Fisher 
1977) like correlations for LBGs, by comparing their kine- 
matics (as measured by the line width of rest-frame opti- 
cal emission lines) to their rest-frame UV and optical magni- 
tudes. No correlations were found over the range 50kms~' < 
a < 115kms _1 . Here we add the four red z > 2 galaxies with 
measured linewidths to Pettini's sample and re-assess the exis- 
tence of correlations with linewidth at z ~ 3. 

Rest-frame V magnitudes were determined from observed 
K s magnitudes and J s — K s colors. For each galaxy we derive 
a transformation between observed magnitudes and rest-frame 
magnitudes of the form 

V rest = K s + a(J s -K s )+/3. (7) 

The constants a and (3 depend on the redshift, but only weakly 
on the spectral energy distribution of the galaxy (see van 
Dokkum & Franx 1996). Using spectral energy distributions 
from Bolzonella et al. (2000) we determined that, for z ~ 3, 
the transformations have the same form within ~ 5 % for SEDs 
ranging from starburst to early-type. Shapley et al. (2001) give 
7T s -band magnitudes for nine galaxies in the Pettini et al. (2001) 
sample, and J - K s colors for six. For the three LBGs that have 
not been observed in J we use synthetic J — K s colors; the added 
uncertainty in V rest is negligible asa= —0.01 for z = 3. 

Figure 10(a) shows the "Tully-Fisher" diagram of rest-frame 
optical linewidth (corrected for instrumental resolution) versus 
rest-frame absolute V magnitude, for z > 2 galaxies. The four 
J s — K s selected galaxies have higher linewidths and are on av- 
erage more luminous than z ~ 3 LBGs. Although visually sug- 
gestive, the Spearman rank and Pearson correlation tests give 
probabilities of ~ 50 % that log a and My are uncorrected, and 
we conclude the evidence for the existence of a luminosity- 
linewidth relation at z ~ 3 is not yet conclusive. Erb et al. 
(2003) find that z « 2 LBGs have on average higher linewidths 
than those that have been observed by Pettini et al. (2001) at 



z « 3, and it will be interesting to see where these galaxies lie 
on the Tully-Fisher diagram. The only z ~ 2 LBG from Erb 
et al. (2003) with measured K magnitude (Q1700-BX691 at 
z = 2.19) is indicated with a large open symbol in Fig. 10(a). 

In Fig. 10(b) linewidth is plotted versus rest-frame U — V 
color, which were derived from the observed J s — K s colors 
in a similar way as the rest-frame V magnitudes. Only the 
six Lyman-break galaxies with measured J — K s colors are 
included. There is a striking correlation, with significance 
> 97%. The figure also clearly demonstrates the large differ- 
ence in rest-frame colors between LBGs and J s — K s selected 
galaxies. In the local universe similar trends exist, with more 
massive galaxies generally being redder. 

Finally, Fig. 10(c) shows the relation between linewidth and 
stellar mass. To limit systematic effects we determined the stel- 
lar masses of the LBGs with our own methods, using the pho- 
tometry presented in Shapley et al. (2001) and Steidel et al. 
(2003). The median stellar mass is 2 x 10 10 M Q , very similar 
to what Shapley et al. (2001) finds for the same nine objects. 
The correlation has a significance of w 90 %. The line shows 
the "baryonic Tully-Fisher relation" of McGaugh et al. (2000), 
which is an empirical relation between rotation velocity and to- 
tal mass in stars and gas (for galaxies with M* > 10 9 M Q this is 
approximately equal to the stellar mass alone). With a w 0.6V C 
this relation takes the formM* = 312a 4 M & in our cosmology; 
without any further scaling it provides a remarkably good fit to 
the z > 2 data. 

We note that the analysis may be affected by selection ef- 
fects, as we combine very different samples of galaxies in Fig. 
10. Pettini's LBGs are at slightly higher mean redshift than 
the DRGs, and their linewidths were measured from the [O III] 
lines instead of Ha. The z ~ 2 LBG sample of Erb et al. (2003) 
shows a much larger range in linewidth than Pettini's z ~ 3 
sample. This may be caused by differences in selection (the 
Pettini sample was selected to be very bright in the rest-frame 
UV, which may select relatively low mass objects; see Erb et 
al. 2003), but may also hint at systematic differences between 
linewidths determined from Ha and the oxygen lines, or even 
evolution. It will be interesting to see where the Erb et al. ob- 



14 



NIR spectroscopy of red galaxies 



jects fall in the panels of Fig. 10, especially those with the high- 
est linewidths. 

We are also limited by small number statistics. In particu- 
lar, the reason why linewidth appears to correlate most strongly 
with color may simply be that there is no /-band measurement 
given by Shapley et al. (2001) for West MMD11, which is a 
luminous and apparently massive Scuba source with very small 
linewidth (a = 53 ±5 km s -1 ). When this object is arbitrarily 
removed in Figs. 10(a) and (c) as well the correlation signifi- 
cance jumps to w 98 % in both cases. 

6.3. Outflows 

A major result of kinematic studies of LBGs is that they ex- 
hibit galactic-scale outflows (Franx et al. 1997; Pettini et al. 
1998, 2000; Adelberger et al. 2003). The main evidence comes 
from comparisons of the radial velocities of Lya, UV absorp- 
tion lines, and rest-frame optical emission lines. The UV ab- 
sorption lines are blueshifted with respect to the optical lines 
as they trace the gas flowing towards us, whereas Lya emission 
appears redshifted as it originates from the back of the shell of 
outflowing material (see, e.g., Pettini et al. 2001 for details). 
In LBGs winds have typical velocities of ~ 300kms~' (with 
a large range), and are thought to play an important role in en- 



riching their surrounding medium (Adelberger et al. 2003). 

In our sample of DRGs galactic winds may be less efficient in 
enriching the IGM; as the galaxies have very high masses the 
mechanical energy provided by supernovae and stellar winds 
may not be sufficient to let the gas escape. We can constrain the 
wind velocities for only two galaxies in our sample: C-3396, 
for which we have measured Lya, UV absorption lines, and the 
[Olll] lines, and M-1319, with Lya and Ha. In C-3396 the 
implied wind velocity is <~ 300 km s ~ 1 , similar to typical values 
for LBGs. In M-1319 it is only ~ lOOkms -1 . Given the large 
range observed for LBGs, much larger samples are needed to 
reliably compare the wind velocities of DRGs and LBGs, and 
to determine whether the wind velocities of DRGs exceed their 
escape velocities. 

7. COMPARISON TO OTHER GALAXIES 

The J s — K s selection technique is one among many ways 
to select distant galaxies. Examples of other galaxy popula- 
tions beyond z ~ 2 are Lyman break galaxies (Steidel et al. 
1996), Lya emitters (Hu, Cowie, & McMahon 1998), hard X- 
ray sources (e.g., Barger et al. 2001), and submm sources (see, 
e.g., Blain et al. 2002). With the growing diversity of the high 
redshift "zoo" it becomes increasingly important to determine 
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Fig. 1 1 . — Correlations of Ha equivalent width (a,b) and the [N II] A6584/ Ha ratio (c,d) with rest-frame U — V color and V-band absolute mag- 
nitude, for samples of star-forming galaxies. Shown are nearby normal galaxies from the Nearby Field Galaxy Survey (Jansen et al. 2000,2001) 
(small dots), nearby Luminous InfraRed Galaxies from Arhus et al. (1989) (crosses), LBGs from Pettini et al. (2001) (open circles), and the three 
non-active DRGs with measured Balmer lines from the present study (large solid circles). The metallicity scale on the right axis of (c) and (d) 
corresponds to the LBGs from Pettini et al. (2001). This scale is coupled to the [Nil] / Ha scale through the Denicolo et al. (2002) calibration. No 
extinction corrections were made. The broken line in (b) indicates a line of constant Ha luminosity; the dashed line in (c) and (d) indicates Solar 
metallicity. Contrary to LBGs, the red z > 2 galaxies have similar colors, metallicities, and Ha equivalent widhts as normal nearby galaxies. 
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the relations between these populations, with the aim of estab- 
lishing plausible evolutionary histories for nearby galaxies. 

7.1. Comparison to Star Forming Galaxies at z = — 3 

Star formation in local and distant galaxies usually takes 
place in one of two modes: low intensity, prolonged star for- 
mation in a disk (most nearby galaxies), or high intensity, rela- 
tively short lived star formation (star burst galaxies, LIRGs and 
ULIRGs, and probably also Lyman break galaxies). The DRGs 
studied in this paper seem to have different star formation histo- 
ries: high intensity star formation that is maintained for a long 
time (up to several Gyr). 

These trends are illustrated in Fig. 11, which shows cor- 
relations between emission line properties, color, and abso- 
lute magnitude for DRGs, LBGs, normal nearby galaxies from 
Jansen et al. (2000), and Luminous InfraRed Galaxies (LIRGS) 
from Armus, Heckman, & Miley (1989). In (a) we show the 
relation between Ha equivalent width and rest-frame U — V 
color. For C-3396 and the LBGs the equivalent widths were de- 
termined using the empirical relation W\(Hf3) = 0.22W.x(Ha) 
(Jansen et al. 2000). The equivalent width is a measure of 
the ratio of present to past star formation, and therefore a good 
indicator of the time that has elapsed since the onset of star for- 
mation. Lyman break galaxies have high values of Wa(Hq;), 
similar to the much redder LIRGS, consistent with the burst- 
like, short-lived nature of their star formation. The colors of 
DRGs are similar to those of LIRGS, but their equivalent widths 
are similar to those of normal nearby galaxies, indicating pro- 
longed star formation histories. As can be seen in Fig. 11(b) 
DRGs, LBGs, and LIRGS all have very high luminosities com- 
pared to normal galaxies. The slope of the dotted line indicates 
constant Ha luminosity; the Ha luminosities for the samples of 
LBGs, DRGs, and LIRGS are similar. 

In Fig. 11(c) we show the relation between the [Nil] /Ha 
ratio and color. Local galaxies show a strong relation in this 
diagram, a reflection of the fact that more metal rich galaxies 
are redder. The two DRGs fall on the relation defined by local 
galaxies, and have similar or slightly lower metallicities than 
LIRGs. The DRGs also fall on the local relation of [N II] / Ha 
ratio and luminosity (Fig. 1 Id). For LBGs the [N II] /Ha index 
has not been measured due to the fact that Ha falls outside the 
K window. The open circles in (c) and (d) were taken from Pet- 
tini et al. (2003), and correspond to the right axis. The relation 
between the left and right axes is that of Denicolo et al. (2002). 
The two DRGs have higher metallicities than most LBGs, con- 
sistent with their redder colors and older ages. 

We conclude that the DRGs fall on the local relations be- 
tween Ha equivalent width and color, metallicity and color, and 
metallicity and luminosity. They have similar colors as LIRGS 
but lower WA(Ha), indicating more prolonged star formation. 
LBGs do not fall on the relations defined by normal nearby 
galaxies, as they have a low metallicity for their luminosity and 
are much bluer (see also, e.g., Pettini et al. 2001). 

7.2. Comparison to Early-type Galaxies 

It seems difficult to avoid the conclusion that the seven galax- 
ies studied here are very massive. Three lines of evidence imply 
masses J> 10 n M Q : stellar population synthesis model fits to the 
UBVRIJHK spectral energy distributions (all seven galaxies); 
the luminosity and equivalent width of Ha (two galaxies); and 
the line widths combined with approximate sizes (four galax- 

10 Their definition of elliptical galaxies includes many SO galaxies. 



ies). 

Given the high masses, red rest-frame optical colors, and ap- 
parent clustering (Daddi et al. 2003; van Dokkum et al. 2003) 
of J s — K s selected galaxies we can speculate that some or all 
of their descendants are early-type galaxies. It is therefore in- 
teresting to compare the masses and M/L ratios to those of 
early-type galaxies at lower redshift. Such comparisons are 
fraught with uncertainties, and given the small size of our sam- 
ple should be regarded as tentative explorations. 

In Fig. 12 we compare the stellar masses (derived from the 
SEDs) of the seven galaxies in our sample to the stellar mass 
distribution of elliptical 10 galaxies in the Sloan Digital Sky Sur- 
vey (SDSS), as compiled by Padmanabhan et al. (2003). The 
SDSS sample includes both field and cluster galaxies. The dis- 
tribution of the J s — K s selected galaxies is similar to that of 
local elliptical galaxies. In particular, if we require that their 
present-day masses should not exceed those of the most mas- 
sive galaxies observed in the local universe, their stellar masses 
cannot grow by much more than a factor of 2-3 after (z) = 2.6. 
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Fig. 12. — Comparison of the stellar masses (derived from SEDs) of 
the seven J s — K s selected galaxies discussed in the present paper to 
those of > 29, 000 elliptical galaxies in the Sloan Digital Sky Survey 
(Patmanabhan et al. 2003). The red z > 2 galaxies have similar stellar 
masses as massive elliptical galaxies in the nearby universe. 

Comparisons of luminosities or masses are very sensitive to 
selection effects. These effects are less pronounced when com- 
paring ratios of mass and luminosity. In Fig. 13 we compare 
the dynamical M/L ratios of the red z > 2 galaxies to those 
of nearby early-type galaxies (Faber et al. 1989; Jorgensen et 
al. 1996), early-type galaxies at < z < 1.3 (van Dokkum & 
Stanford 2003; van Dokkum & Ellis 2003), and Lyman-break 
galaxies (Pettini et al. 2001). As most of the previous work was 
done in rest-frame B, we transformed the rest-frame M/Ly ra- 
tios of the J s —K s selected galaxies to M /Lb using the observed 
J s -K s colors (see Eq. 7); we find M/L B kM/L v w0.8 in Solar 
units. Based on the range of M/L ratios within our sample we 
estimate that the uncertainty in this number is approximately 
a factor of two. The high redshift data were tied to the work 
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on early-type galaxies in the following way. For the cluster 
MS 1054-03 we determined rest-frame M/L ratios and masses 
in the same way as for the z > 2 galaxies, using their effective 
radii, velocity dispersions, and total rest-frame B magnitudes. 
The M/L ratios correlate with the mass. We fitted a powerlaw 
to this correlation, and determined the M/L B ratio correspond- 
ing to the median mass of the z > 2 DRGs. The MS 1054- 
03 data were tied to the other cluster- and field galaxies using 
the offsets derived from the Fundamental Plane (see, e.g., van 
Dokkum et al. 2001). We note that this procedure gives M/L 
ratios at z = which are similar to the average M/L ratios mea- 
sured for nearby elliptical galaxies: van der Marel (1991) finds 
M/L B w 7(M/L) , and our method gives M/L B w8(M/L) at 
z = 0. 
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Fig. 13. — Dynamical mass-to-light ratios of early-type galaxies at 
< z < 1.3 and galaxies at z > 2. The solid points probably rep- 
resent the most massive galaxies in the universe at any epoch. The 
line is a fit to the redshift evolution of cluster galaxies, and has the 
form logM/Lg = 0.89 — 0.43z. The extrapolation to higher redshifts 
is (obviously) still very uncertain. 

The M/L ratios of the red z > 2 galaxies are lower by a factor 
of ~ 3 than those of the highest redshift early-type galaxies yet 
measured. They are consistent with a simple empirical extrap- 
olation: the line is a fit to the cluster galaxies of the form 

\ogM/L B = 0.89 - 0.43z. (8) 

It is tempting to interpret this figure in terms of simple passive 
evolution. However, progenitor bias (van Dokkum & Franx 
2001) and other selection effects would undermine the value 
of such an analysis, at least beyond z = 1.3. Furthermore, the 
models presented here suggest that the DRGs are not passively 
evolving, but still forming new stars. Finally, dust corrections 
have not been applied. Rather than interpreting the figure in 
terms of stellar evolution, the solid points can be viewed as the 
observed change in mass-to-light ratio of massive galaxies as 
a function of redshift. Galaxies with M/L ratios higher than 
the broken line may be very rare, and the empirical Eq. 8 can 
provide feducial upper limits to the masses of distant galaxies 
when only their luminosities are known. 



8. CONCLUSIONS 

Near-infrared spectroscopy with Keck/NIRSPEC has pro- 
vided new constraints on the nature of red galaxies at redshifts 
z > 2. The results are not unique, as they depend on the as- 
sumed functional form of star formation history, and on the (un- 
known) extinction towards the line emitting gas. The low Ha 
equivalent widths imply either significant extinction or a de- 
clining star formation rate. For constant star formation models 
we find total extinction toward Hll regions of Ay = 2 — 3 mag, 
extinction-corrected star formation rates of 200-400 M Q yr -1 , 
ages 1 — 2.5 Gyr, and stellar masses 1 — 5 x 10 n M Q . In declin- 
ing models the star formation rates and extinction are lower but 
the stellar masses are similar. Measurements of Ha and H/3 for 
the same objects will help constrain the extinction toward the 
line-emitting gas. 

The J s — K s selected objects extend the known range of prop- 
erties of early galaxies to include higher masses, mass-to-light 
ratios, and ages allowing a re-evaluation of the existence of 
galaxy scaling relations in the early universe. By combining 
our small sample of DRGs with a published sample of LBGs 
we find significant correlations between line width and color 
and line width and stellar mass. The latter correlation is in- 
truiging, as it has the same form as the "baryonic Tully-Fisher 
relation" in the local universe. There are, however, many un- 
certainties. In particular, it is unclear whether the line widths at 
z > 2 measure regular rotation in disks, and if they do, whether 
they sample the rotation curves out to the same radii as in local 
galaxies. Also, the correlations are currently only marginally 
significant, and larger samples are needed to confirm them. Fi- 
nally, the comparison to z ~ 3 LBGs may not be appropriate: 
initial results from studies at z ~ 2 indicate that their measured 
properties are markedly different from the z « 3 LBGs, with 
the lower redshift galaxies being more massive and more metal 
rich (Erb et al. 2003 and private communication). A plausible 
reason is that the z ~ 3 LBGs were selected to be bright in the 
rest-frame ultraviolet, which may select lower mass and lower 
metallicity objects (Erb et al. 2003). It will be interesting to see 
if the z « 2 LBGs deviate from the relations seen in Fig. 10. 

It is still unclear how DRGs fit in the general picture of 
galaxy formation. They are redder, more metal rich, and more 
massive than z~3 Lyman break galaxies of the same luminos- 
ity, which suggests they will evolve into more massive galax- 
ies. A plausible scenario is that they are descendants of LBGs 
at z ~ 5 or beyond, in which continued star formation led to a 
gradual build-up of dust, metals, and stellar mass. It is tempting 
to place DRGs in an evolutionary sequence linking them to Ex- 
tremely Red Objects (EROs) and massive ^—selected galaxies 
at 1 < z < 2 (e.g., Elston, Rieke, & Rieke 1988; McCarthy et al. 
2001; Cimatti et al. 2002; Glazebrook et al. 2004) and early- 
type galaxies in the local universe. The apparently smooth se- 
quence of massive galaxies in the M/L versus redshift plane is 
suggestive of such evolution. However, it is not clear whether 
galaxies follow "parallel tracks" in such plots (i.e., DRG — > 
ERO — ► early-type, and LBG — ► spiral) or evolve in complex 
ways due to mergers or later addition of metal-poor gas (e.g., 
Trager et al. 2000; van Dokkum & Franx 2001 ; Bell et al. 2004). 

The subsequent evolution of DRGs should also be placed in 
context with the evolution of the cosmic stellar mass density. 
The red galaxies likely contribute ^ 50 % of the stellar mass 
density at z « 2.5 (Franx et al. 2003; Rudnick et al. 2003), 
a similar contribution as that of elliptical galaxies to the local 
Stellas mass density (e.g., Fukugita, Hogan, & Peebles 1998). 
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However, studies of the Hubble Deep Fields suggest that the 
stellar mass density of the Universe has increased by a factor of 
~ 10 since z ~ 3 (Dickinson et al. 2003; Rudnick et al. 2003), 
and it is unknown whether DRGs participated in this rapid evo- 
lution. 

The high dust content that we infer for DRGs has impor- 
tant consequences for tests of galaxy formation models that 
use the abundance of /f-selected galaxies at high redshift (e.g., 
Kauffmann & Chariot 1998; Cimatti et al. 2003). Our results 
show that any observed decline in the abundance will at least 
in part be due to dust: the most luminous K— selected galaxies 
at < z < 1 are dust-free elliptical galaxies, but the K magni- 
tudes of red galaxies at z > 2 are probably attenuated by 1-2 
magnitudes. Therefore, tests of galaxy formation models using 
the "K20" technique require a careful treatment of dust, either 
by correcting the observed abundance for extinction or by in- 
corporating dust in the models. 

There are several ways to establish the properties of red z > 2 
galaxies more firmly, and to enlarge the sample for improved 
statistics. Photometric surveys over larger areas and multiple 
lines of sight are needed to better quantify the surface density, 
clustering, and to measure the luminosity function. Further- 
more, the high dust content and star formation rates imply that a 
significant fraction of DRGs may shine brightly in the submm, 
as evidenced by the SCUBA detection of the galaxy with the 
highest dust content in our sample. The star formation may 
also be detectable with Chandra in stacked exposures, as has 
been demonstrated recently for LBGs (Reddy & Steidel 2004). 
Chandra can also provide estimates of the prevalence of AGN 
in DRGs; this fraction could be relatively high if black hole ac- 
cretion rates correlate with stellar mass at early epochs. Spitzer 
can provide better constraints on the stellar masses, as it sam- 



ples the rest-frame K band. 

Finally, the results in this paper can clearly be improved by 
obtaining more NIR spectroscopy of red z > 2 galaxies. Con- 
trary to the situation for LBGs, the sample of DRGs with mea- 
sured redshift is rather limited due to their faintness in the 
rest-frame UV and the resulting difficulty of measuring red- 
shifts in the observer's optical (see van Dokkum et al. 2003; 
Wuyts et al. in preperation). Furthermore, samples of galaxies 
with optically-measured redshifts are obviously biased toward 
galaxies with high star formation rates, low dust content, and/or 
active nuclei; it may not be a coincidence that the galaxy with 
the highest dust content is the only one in our sample without 
Lya emission. "Blind" NIR spectroscopy is notoriously diffi- 
cult, but may be the best way of determining the masses, star 
formation rates, and dust content of DRGs in an unbiased fash- 
ion. It is expected that such studies will be possible for large 
samples when multi-object NIR spectrographs become avail- 
able on 8-10m class telescopes. 
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APPENDIX 
OPTICAL SPECTROSCOPY IN CDF-S 

We obtained rest-frame UV spectroscopy of J s — K s selected galaxies in the Chandra Deep Field South (CDF-S) prior to our 
NIRSPEC observations. The spectra were obtained with the Focal Reducer/Low Dispersion Spectrograph 2 (FORS2) on the Very 
Large Telescope (VLT) 2002 December 5-6. Galaxies with J s — K s > 2.3 and K s < 22 were selected using NIR photometry obtained 
from the public GOODS survey (see §3.2). The 300V grism was used, giving an approximate wavelength range of 0.3-1.0/x. 
Conditions were photometric, and the seeing was « 1"5. After each individual 1800 s exposure the objects were moved along the 
slit to facilitate sky subtraction. A bright blue star was included in each of the masks to enable correction for atmospheric absorption. 
The reduction followed standard procedures for dithered multi-slit spectroscopic data (see, e.g., van Dokkum & Stanford 2003). An 
initial inspection of the reduced data in 2002 December provided two redshifts for J s — K s selected objects; as these galaxies were 
subsequently observed with NIRSPEC their spectra are presented here. The full photometric and spectroscopic datasets of J s — K s 
selected galaxies in CDF-S will be presented in Wuyts et al., in preparation. 

The two spectra are shown in Fig. A14. Galaxy C-237 has a strong emission line at 4354 A. The line is identified as Lya redshifted 
to z = 2.5808, as no other emission lines are seen in the optical window. The spectrum of galaxy C-3396 is qualitatively similar to 
spectra of LBGs (see, e.g., Steidel et al. 1996), and the galaxy has an unambiguous redshift z = 3.25. The two strongest absorption 
lines are blueshifted with respect to Lya, as is common in LBGs (Shapley et al. 2003). We find Zhya = 3.2538, zsin = 3.2462, and 
Zen = 3.2458, each with an error of w 0.0005. 
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Fig. A14. — Rest frame UV spectra of two galaxies with J s — K s > 2.3 in CDF-S. Grey bands indicate locations of strong atmospheric emission 
lines. Galaxy C-237 has a weak continuum and one strong emission line which we interpret as Ly-a at z = 2.5808. Galaxy C-3396 has Ly-a in 
emission and shows interstellar absorption lines typical of star forming galaxies. 



ARE LBGS AND DRGS ORTHOGONAL POPULATIONS? 

The fact that the majority of J s — K s > 2.3 galaxies are fainter than R = 25.5 in the rest-frame ultraviolet provides an upper limit to 
the number of galaxies that may fall in both categories. Franx et al. (2003) and van Dokkum et al. (2003) find that more than 70% 
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or the red galaxies are fainter than this limit, and would therefore not be present in spectroscopic samples. Here we examine whether 
the most luminous DRGs (which are the subject of this study) would be selected by the Lyman break technique. 

The redshift information in addition to the high quality SEDs allow us to determine whether the bright galaxies in our sample 
would be selected as Lyman-break galaxies. Synthetic colors in the Steidel & Hamilton (1992) filter system are determined from the 
best fitting Bruzual & Chariot (2003) models, including extinction and absorption due to the Lyman limit and the Lya forest (Madau 
et al. 1996). In Fig. B15 we show the locations of the seven galaxies in the U„ — G versus G — 7Z plane. None of the seven DRGs with 
spectroscopic redshift fall in the "C+D" selection region; two are just inside the wider "M+MD" region (see Steidel et al. 2003). 




0.5 1 1.5 2 

G - 31 



Fig. B15. — Location of the J s —K s selected galaxies on the Lyman-break selection diagram. Solid symbols with errorbars show the rest-UV 
colors of DRGs at their observed redshift. Tracks and open symbols show the effect of varying the redshift over the interval 2 < z < 4. Solid lines 
delineate the primary selection region ("C+D") employed by Steidel and collaborators; the broken line indicates their more generous "M+MD" 
criteria (see Steidel et al. 2003). The J s — K s objects have red rest-frame UV spectra due to dust, and would only be selected as LBGs in a very 
small redshift range, or not at all. 

This result partly reflects the narrow redshift range that the Steidel & Hamilton filters are tuned to: Lyman-break galaxies at z = 2.4 
would require a "bluer" filter set than U„G1Z, or a different selection window in the U„ — G vs. G — 1Z diagram (see Erb et al. 2003). 
Rather than blueshifting the filters we artificially redshift the galaxy spectra over the range 2 < z < 4, properly accounting for the 
changing intergalactic absorption. As can be seen in Fig. B15 three of the seven galaxies would fall in the primary selection area if 
they were at z ~ 3. The G — 7Z colors of the remaining four are too red to fall in the primary selection area for any redshift; however, 
three are just inside the wider "M+MD" area for z ~ 2.7. 

Even though six out of seven galaxies could be selected as "C+D" or "M+MD" Lyman breaks for certain redshifts, the 
probability that they enter spectroscopic LBG samples is low. For a random distribution in redshift, star forming galaxies with 
E(B — V) = 0.0 — 0.3 are ^ 5 times more likely to fall in the selection box than the dusty DRGs (see Fig. 2 in Steidel et al. 2003). 
Based on our sample of seven we conclude that even the small fraction of red galaxies which has R < 25.5 is underrepresented in 
spectroscopic LBG samples, because of their red rest-frame UV colors. 
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TABLE 1 

Galaxy Sample 



Id 


vD03" 


zuv 






1 — K 




\ ( 1 IXTl\ 


1V1 1 \> JJ 


1195 


2.424 


19.57 


19.91 


2.25 


3,600 


? nSR - ? 47S 














1,800 


1.508 - 1.778 














2,700 


1.266- 1.540 


M-1356 


1458 


2.427 


19.67 


19.82 


2.13 


3,600 


2.058 - 2.480 


M-1383 


1656 


2.423 rf 


19.55 


19.77 


2.83 


5,400 


1.993-2.410 


M-1319 


1671 


2.423 


19.01 


19.15 


2.58 


5,400 


2.050 - 2.470 














3,600 


1.282- 1.558 


C-237 




2.581 


20.00 




2.63 


5,400 


2.056 - 2.480 


M-140 


184 


2.705 


19.76 


19.87 


2.68 






C-3396 




3.254 


19.66 




2.56 


4,500 


1.980-2.400 



" Id number from van Dokkum et al. (2003). 
* Total K s magnitude on Vega system. 

c K s magnitude corrected for the lensing effect of the foreground cluster. 
d Revised redshift from rest-frame optical spectroscopy (see § 4.2.1). 



TABLE 2 

Emission Lines 



Id 


£opt 


Line 


f 


W" 


L c 


a d 


M-1035 


2.424 


Hf3 


2.9 ±2.1 


28 ±20 


1.4±1.0 








[O m] 4959 A 


13±4 


116±42 


5.9± 1.8 


231 ±26 






[O III] 5007 A 


31 ±7 


275 ± 86 


14.3 ±3.2 


231 ±26 






Ha + [Nil] 


17±4 


114±32 


8.1 ± 1.9 


< 1600 


M-1356 


2.430 


Ha + [N II] 


19±4 


104 ±23 


8.7±2.0 


< 2500 


M-1383 


2.423 


Ha 


6.5 ±1.7 


31 ±9 


3.0±0.8 


153 ±25 






[Nil] 


2.6 ±0.9 


13±4 


1.2±0.4 


153 ±25 


M-1319 


2.424 


Ha 


6.3 ±1.4 


19±4 


2.9 ±0.7 


240 ±56 






[Nil] 


5.0±1.2 


15 ±3 


2.3 ±0.5 


240 ±56 


C-237 


2.581 


Ha + [N II] 


7.9 ±2.4 


46± 11 


4.3± 1.3 


<, iooo 


C-3396 


3.249 


up 


2 5+ 2 ' 5 


7 +7 
'-4 


2 4+ 2 4 








[0 III] 4959 A 


4.5 ±2.1 


12±5 


4.3 ±2.0 


134±S 






[0 ill] 5007 A 


10±3 


26±8 


9.5 ±2.9 


1341S 



" Line flux in units of 10 _ 17 ergs s~ 1 cm 
* Rest-frame equivalent width in A. 
c Line luminosity in units of 10 42 ergs s~ 
d Line width in km s~ 1 . 



TABLE 3 
Emisson Line Corrections" 



Id 




U 


B 


V 


Vf,06 


hi4 


z 


Js 


H 


K s 


M- 


-1035 





1.2±0.3 


0.56 ±0.20 


0.39±0.10 


<0.10 




0.11 ±0.10 


0.31 ±0.06 


0.16 ±0.03 


M- 


-1356 





0.12 ±0.02 


0.09 ±0.02 


0.02 ±0.02 


<0.05 




(0.07 ±0.03) 


(0.24 ±0.10) 


0.16 ±0.02 


M- 


-1383 





<0.05 


<0.05 


<0.05 


<0.05 




(0.06 ±0.03) 


(0.03 ±0.02) 


0.08 ±0.02 


M- 


-1319 





0.10±0.02 


<0.05 


<0.05 


<0.05 




(0.03 ±0.03) 


(0.02 ±0.02) 


0.04 ±0.02 


C- 


237 





0.77 ±0.20 


<0.05 


< 0.05* 


< 0.05 c 


(<0.05) 


(<0.1) 


(<0.1) 





M- 


-140 





0.26 ±0.03 


< 0.05 


< 0.05 


< 0.05 




(0.05 ±0.05) 


(< 0.05) 


(< 0.05) 


C 


-3396 








0.06 ±0.02 


< 0.05* 


< 0.05" 


(<0.05) 


(< 0.05) 


(0.08 ±0.07) 


0.07 ±0.05 



" Units are magnitudes; values in parentheses were not measured but calculated based on line ratios of nearby galaxies. 
* FORS R filter. 
c FORS / filter. 
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TABLE 4 
Results From SED Fits 



Id logr A v logSFR logM, 

yr mag Mq yr~ 1 Mq 



M- 


-1035 


8 9 +UJ 


L - y -0.2 


2 50 +Uu4 


11 


2g4-o.iv 

zo -0.13 


M 


-1356 


q 9+0.2 


9+° 1 


2 oi +0 10 


11 


, O+0.09 
1 -'-0.12 


M- 


-1383 


9 +0 4 


9 a+0.5 
z - J -0.3 


2 64+° 36 


11 


cc+0.14 
J -0.21 


M- 


-1319 


94+0 


1 3 +al 
L - J -o.i 


9 qq+0.09 
z --' y -0.04 


11 


O8 -0.08 


C- 


237 


94+0 




2 40+ 016 

^•^"-0.08 


11 


ftQ+0.08 
uy -0.17 


M- 


-140 


8 9+ 02 




2 67+° 08 

z - u/ -0.09 


11 


4 3+0.14 


C- 


3396 


9 2 +al 




9 jrr+0.21 
^•^ -0.13 


11 


60 +0.07 
uu -0.34 



TABLE 5 

Star Formation Rates in Different Models 





Continuous (r 


= oo) 


Declining 


(r = 300Myr) 




M-1383 M 


-1319 


M-1383 


M-1319 


Age (SED) (Gyr) a 


1.0 


2.6 


0.7 


0.7 


A v (SED)" 


2.3 


1.3 


1.8 


1.0 


SFR (SED) a 


439 


243 


122 


106 


SFR(Ha;A H n = 0)* 


24 


23 


24 


23 


SFR (Ha; Ann = Av) h 


169 


69 


110 


54 


SFR^QiAHi^Av + l)* 


393 


161 


257 


125 



Ages, star formation rates and extinction derived from fits to broad band spectral energy distributions. 
Star formation rates derived from Ha, for different assumptions for the extinction toward H II regions. 



